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(54) HVQ compression with an error term 

(57) Improving the accuracy of a lossy compressor 
(10) by decompressing the compressed input word and 
comparing the result to the original word. An error term 
is produced, and the total compressor output becomes 
the output of the lossy compressor and the error term. 



The compression can be improved by limiting the error 
term to a number of most significant bits, and by loss- 
lessly compressing both terms. An HVQ compressor 
can be used as the lossy compressor. 
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Description 

[0001] When data must be transmitted or stored it is 
usually compressed first, to reduce the transmission 
time or storage requirement. This is especially true of s 
image data which can consist of four color separations 
of 8-bit pixels, 

[0002] One method of compression is HVQ, where a 
block of data can be reduced to a single codeword in a 
number of steps. This method is described in U.S. Pat- 
ent Number 5,602,589, and is incorporated by refer- 
ence herein. This procedure can be explained most 
clearly by the use of a numerical example in connection 
with Figure 1 a of that patent. Assume that a 2 by 4 pixel 
block, 8 bits per pixel, is to be compressed to one 9-bit 
number. First, each pair of pixels is applied to a stage 1 
look up table (LUT) containing 9-bit code words, each 
code word being associated with a two-pixel bit pattern. 
The table is set up so that if an exact match of the input 
pixel bits is not available, then a codeword associated 
with the closest match is output instead. The ultimate 
output of this stage 1 is four codewords, each describing 
the closest match to the bit pattern of the two input pix- 
els. Since the exact match frequently is not possible, 
this compression is inherently lossy to some extent. The 
output of the first stage is four 9-bit codewords. 
[0003] Assuming this amount of compression is not 
sufficient, these four codewords, each representing the 
bit pattern of a 1 by 2 pixel block are next applied to two 
stage 2 tables, resulting in a total of two 9-bit output 
codewords, each representing the bit pattern of a 2 by 2 
pixel block. If the compression is still insufficient, these 
two are applied to the stage 3 table to yield the final out- 
put word, which is associated with a 2 by 4 pixel block. 
It can now be seen that this method is hierarchical in 
that a plurality of stages are used, it involves vectors 
since the input pixels have a directional relationship with 
each other, and is quantized in that a number of levels 
can be implemented to achieve any degree of compres- 
sion. 

[0004] Fordecompression, each codeword is simply 
applied to a 256 K by 64 bit LUT which outputs the bit 
pattern of the entire 8-pixel block. 
[0005] In accordance with one aspect of the present 
invention, a process of compressing an original multi-bit 
word comprises the steps of compressing the original 
word using a lossy compressor to create a compressed 
word which comprises one part of the output, 

decompressing the compressed word to form a 
decompressed word, 

comparing the decompressed word and the original 
word to form an error word which forms the other 
part of the output. 

[0006] In accordance with a second aspect of the 
present invention, a data compression system com- 
prises a lossy compressor for compressing an original 



multi-bit word to create a compressed word which com- 
prises one part of the output, a decompressor for 
decompressing the compressed word to form a decom- 
pressed word; and a comparator for comparing the 
decompressed word and the original word to form an 
error word which forms the other part of the output. 
[0007] An additional element that can be added to the 
general HVQ system to improve its performance is an 
error channel. After compression, the compressor 
10 decompresses the final output codeword to form the 
associated bit pattern, and then compares that to the 
original input pixel block bit pattern. If there was any loss 
in the compression process, then an error can be calcu- 
lated. This error can then be sent along with the com- 
15 pressed output to the decompressor. After the 
decompressor has produced the final bit pattern, the 
error term can be added to it to produce an errorless 
final product. Of course, if the entire error term is sent 
along with the compressed output, the compression 
20 ratio will suffer. The best tradeoff is to send only a few of 
the most significant bits of the error term, perhaps only 
two or three bits. In this way, the final result is improved 
accuracy and/or compression. 

[0008] The performance of a "hierarchical vector 
25 quantization" (HVQ) circuit can thus be improved by 
using the compressor to decompress the data, compare 
the result to the original data to produce an error term, 
and send the error term to the decompressor to correct 
errors in the final decompressed data. 
30 [0009] Some examples of methods and system 
according to the present invention will now be described 
with reference to the accompanying drawings, in which:- 

Figure 1 is an HVQ encoder which has a parallel 
35 error channel; 

Figure 2 is the decoder for the arrangement of Fig- 
ure 1 ; 

Figure 3 is a one-channel arrangement for coding a 
pixel split into its most and least significant bits; 
40 Figure 4 is a one-channel arrangement for decod- 
ing a pixel split into its most and least significant 
bits; 

Figure 5 is a two-channel arrangement for coding a 
pixel split into its most and least significant bits; 
45 Figure 6 is a decoder for the coder of Figure 5; 

Figure 7 shows thee prices of rotating or mirror 
imaging an image using HVQ compression; 
Figure 8 shows a boundary between blocks of dif- 
ferent sizes; and, 
so Figure 9 shows a boundary between blocks of the 
same size. 

[0010] The basic HVQ system can be improved by 
adding an error channel as shown in Figure 1. In the 
55 upper channel the grayscale byte map 16 is applied in 
the usual way to an HVQ decoder 10, the output is loss- 
lessly compressed in an LZ encoder 1 1 , and the result 
is sent to the decoder, usually in the form of 8 to 10-bit 
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words. 

[0011] In addition, the output of the HVQ encoder is 
sent to a decoder 12 in scanline format which produces 
a version of the original byte map which may be different 
from the original because of errors possibly introduced 5 
by the lossy encoder. The two byte maps are then sub- 
tracted 13, pixel by pixel, to produce error terms which, 
if added to the output codeword, will produce the origi- 
nal byte map. This, subtraction can also be done by 
using an exclusive OR, which is simpler and faster and 10 
does not require a sign bit. These error terms, each a 
signed quantity 8-bits wide or less, can then be com- 
pressed in an LZ encoder 1 5 and sent to the decoder in 
parallel with the original output. The larger the error 
term, the less will be the compression ratio. In practice, 15 
small errors are not visually detectable. To limit the deg- 
radation of the compression ratio, the error term can be 
limited to a few most significant bits, three for example, 
in quantizer 14. Normally, the amount of error for a pixel 
will not be large enough to show up in the few MSB's, in 20 
which case there will be no error term at all. 
[0012] The quantized error decoder is shown in Figure 
2. The compressed code words are LZ decoded in 
decoder 21 and HVQ decoded in decoder 22 to produce 
one term for the adder 23. The compressed quantized 25 
errors are LZ decoded in decoder 24 and applied as the 
other term to the adder 23, the output of which is output 
video. The adder 23 can either be an adder adding a 
sign bit and seven . bits, or an exclusive OR if one was 
used to generate the error term in the encoder. 30 
[0013] Figure 3 shows the arrangement when a single 
code word 31 is split into most and least significant 
parts, 32, 33, and where only the least significant bits 
are compressed. In this case bits 0 through 4 are sent 
through lossy compressor 35 while bits 5 through 7 are 35 
not. Both are then compressed using lossless LZ com- 
pression 34,36 and output to the decoder shown in Fig- 
ure 4. Here again, both channels are LZ decompressed 
41,42 while only the LSB's are HVQ decoded 43. The 
two resultant parts are then exclusive ORed 44 together 40 
to be applied to the decoding look up table. 
[0014] Figure 5 is a system where a single pixel is 
separated into a least significant segment and a most 
significant segment, and where a separate and different 
compression process 54,56 is used for each segment, 45 
the least significant bits being more compressed. The 
original pixel is separated into its most significant bits 52 
and least significant bits 53. The result is that the most 
significant bits, being the most important, are less com- 
pressed while compression for the least significant bits so 
has a better compression ratio. A programmable look- 
up table could be used to split the input pixel into any 
two segments other than the 3 - 5 split shown. 
[0015] Figure 6 is the decoder for the encoder of Fig- 
ure 5. The two compressed outputs of Figure 5 are 55 
applied to LZ decoders 61, 62 and HVQ 63, 64 
decoded. Then both are applied to an exclusive OR 
gate 65 to assemble the entire pixel. Of course, if the 



encoder used some other combination of encoders, the 
decoder would use the same form of decoding. That is, 
more generally, the data words in a string can be divided 
into more and less significant bits to create two parallel 
strings, and then compressed using any two methods of 
compression where the greater compression is applied 
to less significant bits. 

[0016] HVQ compression is easily adapted to image 
rotation and mirror imaging as shown in Figure 7. The 
process is shown here using the example of an original 
image that is four pixels high and sixteen pixels wide, 
and must be rotated ninety degrees clockwise and mir- 
ror imaged. 

[0017] Step 1 is the usual compression process of 
reducing the eight 8-pixel segments to eight codewords 
Cw1 to Cw8. Step 2 is the step of rearranging the code- 
words into the rotated and mirror imaged order. This 
hardware can be in the form of wiring where the second 
word in, for example Cw2, is connected to the third word 
out, as shown. Step 2 can have several sets of wiring, 
each set delivering a different rotation. Finally, decoding 
step 3 uses a look-up table to produce a pixel pattern for 
each segment that is properly oriented. Here again, 
several tables can be used to produce various orienta- 
tions. 

[0018] Printing hints may be incorporated into the 
original data supplied by the user in the original page 
description language to indicate to the printer how the 
data may best be printed. For example, a hint word may 
be two bits in length, and indicate one of four possibili- 
ties, that the following data is text, contone, graphics, 
etc. For example, if the printer is receiving data that orig- 
inated as a computer generated graphic it may use a dif- 
ferent halftone screen than it would if the original data 
was screened in from a photograph. 
[0019] Printing hints may be added to any HVQ chan- 
nel as shown in Figure 5. Assume that for each 4 pixel 
block entering into the HVQ encoder 54 there is pro- 
duced one codeword 9 bits in length, contained in two 8- 
bit bytes, so that the first 8 bits are contained in the first 
byte and the last bit is contained in the second byte. 
Then the 2-bit hint is added. Now, each codeword plus 
hint is 11 bits, still contained in two bytes. The LZ 
encoder 55 looks at a string of bytes, perhaps 256 bytes 
in length, and determines the location and size of the 
most recent identical pattern match. To the extent that 
the hint changes once or twice within that string, there 
will be a slightly decreased amount of compression in 
comparison to the amount that would have resulted with 
no hints. However, to the extent that the hint does not 
change during that time, there is no decrease in com- 
pression at all. The result is that printing hints can be 
supplied after an HVQ compressor but before the loss- 
less compressor with very little effect on the compres- 
sion ratio. At the output side, after the code words are 
LZ decompressed, but before being decoded, the hints 
can be extracted for later use. 

[0020] The losses of an HVQ compressor can be fur- 
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ther minimized by choosing codewords and output data 
patterns that have the best chance of matching the 
actual input data patterns. For example, first consider 
text. If text pixels are being encoded in 4 by 2 pixel 
groups, and the four input pixels in one line are black, 5 
dark gray, light gray and white, and the input video was 
scanned-in text, the data most likely originally was a 
boundary between a black letter and a white space, so 
the output pixel pattern could be black, black, white, 
white. On the other hand, if the original input data was a 1 q 
scanned-in computer generated graphic, the four pixels 
are more likely to be a smooth decrease in density from 2. 
black to white. The actual determination of the encoder 
codewords and patterns in the decoder look-up table 
are determined by statistical analysis. A representative 15. 3. 
group of text and graphic documents are passed 
through a test program and the best values are gener- 
ated for each type. 4. 
[0021] A complication arises when a boundary passes 
through an input block of pixels, in which case neither 20 
text nor graphic values can be used for the entire block. 
The solution is to supply a third set of codewords and 5. 
patterns which are generated specifically for this bound- 
ary condition. In this case a set of documents containing 
both text and graphics would be analyzed to produce 25 
one set of patterns, which would be used when a 
boundary, mixed, condition is determined to be within 
the block. 

[0022] The boundary condition is sensed by observing 
the printing hints. For example, a rectangular scanned- 30 
in picture is typically located on a page of text by its x,y 
coordinates. When the raster output scanner beam is 
within the coordinates, the printing hints will indicate to 
the printer which codewords, look-up table entries and 
halftone screens to use. If the hint changes from picture 35 
to text within the block, for example, then the encoder 
knows that a boundary exists within the block. 
[0023] In all cases the block size must be maintained. 
Figure 8 is an example of a transition between text and 
contone. Since different block sizes can be used in HVQ 40 
encoders for different kinds of data, the block size for 
the text is shown here as 2 by 2 pixels to allow greater 
edge detail while the block size for contone is shown as 
being 4 by 2 to allow for greater compression. 
[0024] If a boundary is within a 2 by 2 pixel block, that 45 
block is encoded and decoded using boundary values, 
in addition, any contone pixels to the right of the bound- 
ary such as pixel 81 are also treated as a boundary pix- 
els if necessary so that all remaining pixels to the right 
of the boundary line will be within 4 by 2 pixel blocks. 50 
Similarly, in Figure 9, showing a boundary between 4 by 
2 pixel blocks of text and contone, 2 by 2 pixel boundary 
blocks are used in pairs so that all remaining blocks will 
be 4 by 2. 

55 

Claims 



word comprising the steps of compressing the orig- 
inal word using a lossy compressor (10) to create a 
compressed word which comprises one part of the 
output, 

decompressing the compressed word to form a 
decompressed word, 

comparing the decompressed word and the 
original word to form an error word which forms 
the other part of the output. 

The process of claim 1, further comprising the step 
of losslessly compressing each part of the output. 

The process of claim 1, wherein only the most sig- 
nificant bits of the error word are output. 

The process of any of the preceding claims, 
wherein the lossy compressor (10) is an HVQ com- 
pressor. 

A data compressing system comprising a lossy 
compressor (10) for compressing an original multi- 
bit word to create a compressed word which com- 
prises one part of the output; a decompressor (12) 
for decompressing the compressed word to form a 
decompressed word; and a comparator (13) for 
comparing the decompressed word and the original 
word to form an error word which forms the other 
part of the output. 



1. The process of compressing an original multi-bit 
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(54) HVQ compression with an error term 

(57) Improving the accuracy of a lossy compressor 
(10) by decompressing the compressed input word and 
comparing the result to the original word. An error term 
is produced, and the total compressor output becomes 
the output of the lossy compressor and the error term. 



The compression can be improved by limiting the error 
term to a number of most significant bits, and by loss- 
lessly compressing both terms. An HVQ compressor 
can be used as the lossy compressor. 
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